Unsupervised Ontology Acquisition from Plain Texts: The OntoGain System

نویسندگان

  • Efthymios Drymonas
  • Kalliopi Zervanou
  • Euripides G. M. Petrakis
چکیده

We propose OntoGain, a system for unsupervised ontology acquisition from unstructured text which relies on multi-word term extraction. For the acquisition of taxonomic relations, we exploit inherent multi-word terms’ lexical information in a comparative implementation of agglomerative hierarchical clustering and formal concept analysis methods. For the detection of non-taxonomic relations, we comparatively investigate in OntoGain an association rules based algorithm and a probabilistic algorithm. The OntoGain system allows for transformation of the derived ontology into standard OWL statements. OntoGain results are compared to both hand-crafted ontologies, as well as to a state-of-the art system, in two different domains: the medical and computer science domains.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective Ontology Learning : Concepts' Hierarchy Building using Plain Text Wikipedia

Ontologies stand in the heart of the Semantic Web. Nevertheless, heavyweight or formal ontologies’ engineering is being commonly judged to be a tough exercise which requires time and heavy costs. Ontology Learning is thus a solution for this exigency and an approach for the ‘knowledge acquisition bottleneck’. Since texts are massively available everywhere, making up of experts’ knowledge and th...

متن کامل

طراحی سامانه هوشمند ساخت هستان نگار به کمک شبکه عصبی ARTو روشC-value

In recent years, many efforts have been done to design ontology learning methods and automate ontology construction process. The ontology construction process is a time-consuming and costly procedure for almost all domains/applications, so automating this process is a solution to overcome the knowledge acquisition bottleneck in information systems and reduce the construction cost. In this artic...

متن کامل

The GENIA Project: Knowledge Acquisition from Biology Texts

Overview of Project The GENIA project [9] (Fig. 1) seeks to automatically extract useful information from texts written by scientists to help overcome the problems caused by information overload. We intend that while the methods are customized for application in the microbiology domain, the basic methods should be generalisable to knowledge acquisition in other scientific and engineering domain...

متن کامل

Automatic Rule Retrieval from Websites Using Ontologyand Text Mining

A Rule-based system like an intelligent service comparing portal may compare product prices, shipping options, refund options etc., Such rule based system requires an automatic knowledge acquisition procedure from the Web that consists of unstructured texts. Knowledge acquisition can be carried out by ontology acquisition and rule acquisition. Obtaining information such as product prices from w...

متن کامل

Acquisition of Semantic Knowledge using Machine learningmethods : The System

We describe in this paper the ML system ASIUM which acquires semantic knowledge from parsed technical texts. ASIUM is devoted to the acquisition of case frames and ontologies. Applications requiring case frames and ontologies are numerous. The Dassault Aviation company we are collaborating with is mainly interested in controlling semantics of speciication texts, in terminology acquisition for s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010